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ENZYMATIC METHOD FOR 
MODIFICATION OF RECOMBINANT POLYPEPTIDES 

5 

Background of the Invention 
Many naturally occurring proteins and peptides 
have been produced by recombinant DNA techniques. 

10 Recombinant DNA techniques have made possible the 

selection, amplification and manipulation of expression 
of the proteins and peptides. For example, changes in 
the sequence of the recombinantly produced proteins or 
peptides can be accomplished by altering the DNA 

15 sequence by techniques like site-directed or deletion 
mutagenesis. 

However, some modif rcations to a recombinantly 
produced protein or peptide cannot be accomplished by 
altering the DNA sequence. For example, the C-terminal 

20 of-carboxyl group in many naturally occurring protein and 
peptides often exists as an amide, but this amide 
typically is not produced through recombinant expression 
and is biologically converted after expression in vivo 
from a precursor protein to the amide. 

25 A method of forming a C-terminal amide on a 

recombinantly produced polypeptide by the action of an 
enzyme is known: The enzyme is peptidyl glycine 
a-amidating monooxygenase and is present in eukaryotic 
systems. The enzyme has been used to form an amide on 

30 the C- terminal amino acid of recombinantly produced 

peptides, like human growth hormone releasing hormone in 
vitro , as described by J. Engels, Protein Enoineerino . 
1:195-199 (1987). While effective, the enzymatic method 
is. time consuming, expensive, given unpredictable 

35 yields, and requires significant post-reaction 

purification. The enzymatic method is also limited to 
modifying the recombinantly produced peptide by 
terminal amidation. 

Accordingly, there is a need for a chemical 

40 method that provides for modification of C-terminal 
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a-carboxyl groups of a recombinantly produced 
polypeptide. There is also a need for a method of 
modification that allows addition of a variety of 
moieties to the C-terminal a-carbon reactive groups of a 
5 recombinantly produced polypeptide and that is 

convenient, cheap and capable of producing terminally 
modified recombinant polypeptides in high yield. 
Therefore, it is an object of the invention to develop a 
biochemical method for selective modification of the 
10 C-terminal amino acid of a recombinantly produced 

polypeptide. A further object is to provide a simple 
and economic method for modification of the C-terminal 
amino acid through a transpeptidation reaction. 

15 Summary of the Invention 

These and other objects are accomplished by the 
present invention. The present invention is an 
economical biochemical method for modification of the 
C-terminal amino acid of recombinant polypeptides to 

20 provide polypeptides which cannot normally be obtained 
through recombinant technology. 

The process of the invention utilizes 
transpeptidation which involves contacting an 
endopeptidase enzyme, specific for an enzyme cleavage 

25 site, with a recombinant polypeptide, composed of at 
least one core linked by a cleavage site to a leaving 
unit, in the presence of an addition unit. The 
endopeptidase enzyme cleaves the leaving unit from the 
core at the cleavage site and simultaneously causes the 

30 core and the addition unit to form the desired modified 
recombinant polypeptide. Alternatively, the cleavage of 
the leaving unit and the formation of the linkage 
between the core and addition unit may be completed in 
two separate steps. Subsequent to transpeptidation, 

35 further enzymatic modification of the terminal amino 

acid carboxy group of, the addition unit, through known 
enzymatic methodology, is possible. 
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The endopeptidase enzymes used according to the 
method of the invention include those of the serine or 
cysteine peptidase class. The endopeptidase enzymes 
trypsin and thrombin, of the serine peptidase class, are 
5 especially desirable endopeptidase enzymes to serve as 
cleavage enzymes for the method of the invention. 

The recombinant polypeptide starting material 
Includes a core which may be a truncated version of Its 
natural form. The core may be truncated through 

10 deletion of amino acids at either, or both, of its 

C-terminal and N-termlnal ends, depending on the product 
desired. The recombinant polypeptide also includes a 
leaving unit linked to the core by an enzyme cleavage 
site recognized by the endopeptidase enzyme. The 

15 leaving unit may be one or more amino acid residues. 
The amino acid cleavage site for the 
endopeptidase enzyme may be recognized by the 
endopeptidase enzyme in solo or as a part of a multiple 
amino acid recognition sequence. In addition, according 

20 to the method of the invention, cleavage sites which are 
normally cleaved by an endopeptidase enzyme may be 
rendered less reactive or unrecognizable when adjacent 
to certain other amino acids . Use of this knowledge to 
cause some cleavage sites to be less reactive is used 

25 advantageously to render new and substantial utility to 
endopeptidase enzymes which may otherwise be precluded 
from use in certain transpeptidation reactions. The 
ability to cause combination of the addition unit with 
the core is a desirable characteristic of the 

30 endopeptidase enzyme. The addition unit may be one or 
more amino acid residues which may be modified at the 
C-termlnal a-carboxy at the time of transpeptidation, or 
may be further treated by known enzymatic methodologies 
subsequent to transpeptidation. 

35 The entire transpeptidation process may be done 

in a single step under very mild conditions. The 
starting polypeptide of the invention may be a single- 
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copy recombinant polypeptide, a multi-copy recombinemt 
polypeptide or a single or multi-copy recombinant fusion 
protein construct. The number and sequence of steps of 
cleaving and reacting the starting material can vary 
5 depending on the starting material used. 

The recombinant multicopy polypeptide may be 
multiple copies of the single copy polypeptide linked 
together with or without intraconnecting peptides. If 
an intraconnecting peptide is present, it has at least 

10 one site that is selectively cleavable by the 

endopeptidase cleavage enzyme. The intraconnecting 
peptide may also serve as the leaving group from the 
C-terminal end of a single copy core polypeptide* 

The single copy polypeptides within a multicopy 

15 polypeptide may be linked directly to each other through 
an amino acid linkage recognized by the endopeptidase 
cleavage enzyme. According to this method of the 
invention, it is preferred that a multicopy recombinant 
polypeptide is cleaved into single copy core units and 

20 simultaneously transpeptidated when in the presence of a 
suitable addition unit. The downstream core acts as a 
leaving group for the core immediately preceding it. 
The terminal single copy core of a multicopy recombinant 
polypeptide is iinked to a suitable leaving unit so that 

25 all single-copy polypeptides within the multicopy 

recombinant polypeptide are transpeptidated according to 
the method of the invention. 

A fusion protein construct has three tandomly- 
linked segments including a binding protein connected 

30 via an interconnecting peptide to a single copy or 

multicopy polypeptide. The interconnecting peptide has 
at least one site that is selectively cleavable by a 
chemical or enzymatic method. The binding protein with 
the interconnecting peptide acts as a biological 

35 protecting group and aids in the purification of the 
recombinant multicopy polypeptide. 
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Detailed Description of the Invention 

Recombinant DNA techniques have made possible 
the selection, amplification, and manipulation of 
expression of many naturally occurring proteins and 
5 peptides. It is often desirable to selectively modify a 
recombinant polypeptide at the N- terminal a-amine and/or 
C- terminal a-carboxyl groups. For example, the 
C-terminal reactive carboxyl groups in some naturally 
occurring proteins and peptides can be selectively 

10 converted to an amide to provide for enhancement of 

biological activity. Alternatively, a D-amino acid or 
peptide could be added to replace a terminal amino acid. 

These modifications can result in the formation 
of analogs of the recombinantly produced polypeptide 

15 that are longer acting and more potent than the 

naturally occurring polypeptide. Generally, these types 
of modifications to the recombinantly produced 
polypeptide are not accomplished by alteration of the 
DNA sequence for the recombinantly produced polypeptide 

20 because there is no genetic code providing for amino 
acid amides, or incorporation of D-amino acid or an 
amino acid derivative. 

The present invention provides a process for 
the selective modification of a recombinantly produced 

25 polypeptide by a single-step transpeptidation process at 
cleavage sites specific for various cleavage enzymes. 
Alternatively, a two-step transpeptidation process may 
be used whereby the polypeptide is first enzymatically 
cleaved at the cleavage site to form the hydrolysis 

30 product, which is then condensed with a suitable 
addition unit to form the modified recombinant 
polypeptide product. 

The process allows for efficient modification 
of recombinant polypeptides to produce products for 
> 35 which there is no genetic code, for example, C-terminal 
a-carboxyl amidation. 



PROCESS 

The process provides for modification of a 
recombinant polypeptide through transpeptidation . For 
purposes of this invention, "transpeptidation" is 
defined as that process whereby a terminal amino acid or 
a chain of amino acids (leaving unit), linked through an 
endopeptidase enzyme cleavage site at the C- terminal end 
of a recombinant polypeptide, is replaced by another 
amino acid or chain of amino acids (addition unit), in 
the presence of an endopeptidase cleavage enzyme. The 
method of the invention utilizes an endopeptidase 
enzyme, preferably of the serine or cysteine class, as 
the cleavage enzyme to catalyze the transpeptidation 
process • 

The recombinant polypeptide includes a core 
portion and a leaving unit. The core is any useful 
polypeptide sequence such as a native sequence, a 
modified native sequence, a non-native sequence having 
biological activity, transacted forms thereof and 
similar versions. The leaving unit is one or more 
amino acid units. Preferably, the leaving unit is 
linked to the core through an amino acid linkage which 
is recognized as a cleavage site by the endopeptidase 
cleavage enzyme. According to the method of the 
invention, the core polypeptide linked to a leaving unit 
may be derived from any source including chemical 
synthesis, recombinant single copy polypeptide 
expression, multicopy polypeptide expression, or single 
or multicopy fusion protein constructs. 

The recombinant polypeptide is contacted with 
at least one endopeptidase cleavage enzyme specific for 
at least one cleavage site. The enzymatic cleavage of 
the recombinant polypeptide at the linkage of the core 
portion and the leaving unit is conducted in the 
presence of an addition unit. An addition unit is a 
single or multiple amino acid residue which may be 
modified at its C-terminal a-carbon. Alternatively, 



modification of the C-terminal a-carbon end of the 
addition unit may be done subsequent to the 
transpeptidation process. 

The method of the invention also provides for 
cleavage of recombinant multicopy polypeptides into 
single copy polypeptides by the endopeptidase cleavage 
enzyme. Under the method of the invention, in the 
presence of a suitable leaving unit, cleavage of the 
multicopy polypeptide will occur simultaneously with 
single-step transpeptidation. Alternatively, the 
polypeptide may be cleaved, at the cleavage site, to 
form the hydrolyzed cleaved polypeptide which 
subsequently undergoes condensation with the addition 
unit to form the modified recombinant polypeptide 
product . 

I. Transpeptidation 

The method of the invention provides a modified 
recombinant polypeptide product produced by 
transpeptidation of a recombinant polypeptide. The 
sequence and number of steps in the method of the 
invention can be varied depending upon the desired 
modification of the recombinant polypeptide, the amino 
acid sequence of the desired product peptide, and the 
starting material selected. The transpeptidation method 
of the invention calls for the recombinant polypeptide 
to be contacted with an endopeptidase cleavage enzyme, 
which has specific cleavage activity at the linkage 
between the core and the leaving unit. 

The endopeptidase cleavage enzyme cleaves the 
leaving unit from the carboxy terminal of the core of 
the recombinant polypeptide. Although it is not 
intended to be a limitation of the invention, it is 
believed that during this cleavage, the enzyme forms an 
acyl- or thioacyl-enzyme intermediate with the core. In 
the presence of an appropriate addition unit, under 
proper conditions, the enzyme causes the addition unit 
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to add to the cleaved core. Although it is not intended 
to be a limitation of the invention, it is believed that 
to accomplish this combination, the addition unit 
displaces the cleavage enzyme from the acyl-enzyme 
5 intermediate and links to the core polypeptide where the 
leaving unit was linked. The production of the modified 
recombinant polypeptide is monitored by HPLC or other 
analytic procedure and the reaction is stopped by the 
addition of an acidic solution when the reaction has 

10 reached completion. The amino acid or terminal amino 
acid residue of the addition unit may already be 
modified at its carboxy terminal end at the time of 
undergoing the transpeptidation reaction, such as by 
modification of the C-terminus carboxylic acid to a 

15 carboxamide, or, alternatively, be modified after 
formation of the modified recombinant polypeptide. 

According to the method of the invention, 
preferably, the cleavage site recognized by the cleavage 
enzyme is a site not duplicated in the core or is not at 

20 an enzyme accessible site within the core. The method 
of the invention is also directed to an endopeptidase 
enzyme cleavage of a multicopy recombinant polypeptide 
into single copy recombinant polypeptides and 
simultaneously transpeptidating the cores to form the 

25 modified recombinant polypeptide in a single biochemical 
reaction. 

The invention is further directed to modified 
enzyme cleavage sites which, when adjacent to certain 
amino acid residues, render the site unrecognizable or 

30 less reactive to cleavage. The discovery of the use of 
these unrecognizable or less reactive sites to prevent 
cleavage, renders new and substantial utility to various 
cleavage enzymes which would otherwise be precluded from 
use in certain transpeptidation reactions due to the 

35 detrimental effect of cleavage of recombinant 
polypeptides at sites within the desired core. 
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The leaving units to be cleaved from the core 
are specifically chosen to provide a suitable leaving 
unit for the specific endopeptidase cleavage enzyme. 
The addition units are chosen to provide the amino acid 
5 or peptide chain to complete formation of the desired 
modified recombinant polypeptide. The amino acid or 
terminal amino acid of the addition unit may be modified 
at the C-terminal a-carboxy or it may be modified after 
transpeptidation. Alternatively, the addition unit may 

10 be a peptidomimetic and serve as a linker between the 
core and attached functional unit, as disclosed in co- 
pending patent application Serial No. . 

The cleavage enzymes, according to the method 
of the invention, include the class of endopeptidases . 

15 The endopeptidases suitable for use in the present 

invention include the serine and cysteine peptidases. 
Although it is not intended to be a limitation of the 
invention, the mechanism of action of serine and 
cysteine endopeptidases is believed to involve the 

20 formation of an acyl- or thioacyl-enzyme intermediate 
with the core after cleaving the leaving unit. Under 
appropriate reaction conditions, it is believed that the 
addition unit acts as a nucleophile and displaces the 
endopeptidase cleavage enzyme from the acyl- or 

25 thoiacyl -enzyme intermediate. 

Serine peptidases are in a group of animal, 
plant and bacteria endopeptidases which have a 
catalytically active serine residue in their active 
center. Representative examples of endopeptidases of 

30 the serine peptidase classification include trypsin, 
thrombin, chymotrypsin, enterokinase , subtiliisin, and 
factor Xa. Representative examples of the scysteine 
peptidase classification include ficin and papian. 

The endopeptidase trypsin is found in the 

35 pancreas of all vertebrates. It is released via the 
pancreatic duct into the duodenum as trypsinogen. 
Conversion of trypsinogen into trypsin is initiated in 
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the small intestine by enterokinase . Natural or 
synthetic forms of trypsin are suitable for the method 
of the invention. 

Trypsin is known for its pronounced cleavage 
5 site specificity, catalyzing hydrolysis of only the 

carboxyl end of the -Lys-X and -Arg-X bonds. Trypsin's 
affinity for cleavage at the «Lys*X bond is 
significantly diminished when immediately adjacent to an 
amino acid containing a carboxylic acid side chain, 

10 specifically including the amino acids glutamic acid and 
aspartic acid (i.e., X = glutamic or aspartic acid). A 
discovery of the present invention utilizes the 
knowledge of decreased cleavage activity at the -Lys-X 
cleavage sites when X « an emiino acid containing 

15 adjacent to an amino acid containing a carboxylic acid 
side chain (i.e., X = glutamic or aspartic acid). This 
discovery has rendered the endopeptidase trypsin of 
great utility in the formation of modified recombinant 
polypeptides, according to the method of the invention. 

20 Natural or synthetic forms of thrombin are suitable for 
the method of the invention. 

The glycoprotein endopeptidase thrombin, also 
of the serine peptidase classification, is responsible 
for the conversion of fibrinogen to fibrin. It is 

25 naturally produced during blood coagulation by the 

action of factor X^ upon prothrombin. This endopeptidase 
has considerable sequence homology with trypsin and 
contains the catalytically important residues His, Asp, 
and Ser in the B chain. Thrombin has a cleavage 

30 specificity for the carboxy end of the -Arg- cleavage 
site in specific peptide sequences known as recognition 
sequences • 

Thrombin is known for its cleavage site 
specificity at the carboxyl side of the Arg- residue 
35 within the known recognition sequence for the cleavage 
site -Arg is Gly-Pro-Arg- . A discovery of the present 
invention is that thrombin also cleaves at the carboxyl 
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side of Arg- within the recognition sequence of Gly-Ala- 
Arg. This discovery enhances the use of thrombin for 
transpeptidation by the method of this invention as well 
as other synthetic reactions where knowledge of the Gly- 
5 Ala-Arg recognition sequence will be of benefit. 

h. Transpeptidation Using the Endopeptidase 
Trypsin 

The transpeptidation process according to the 
10 method of the invention may be accomplished using 

starting recombinant polypeptide derived from single or 
multicopy constructs, or single or multicopy fusion 
protein constructs. 

15 1. Trypsin Transpeptidation of a 

Single Copy Recombinant Polypeptide 

The following description is based upon a 
particular recombinant ly-derived core starting 

20 polypeptide, however, it is understood that the method 
of the invention is suitable for transpeptidation of 
polypeptides, regardless of the source. 

The transpeptidation process of the invention 
is preferably a one-step reaction conducted in a buffer 

25 solution capable of maintaining pH at about pH 2-13, 
preferably pH 3-12, and more preferably pH 5-11. 
Suitable buffers for the present invention include Tris, 
succinate, citrate, phosphatate, acetate, tricine, 
hepes, and the like. In one embodiment of the invention 

30 using the serine endopeptidase trypsin as the cleavage 
enzyme for the transpeptidation method of the invention, 
the modified recombinant polypeptide, for example 
Glucagon-like Peptide 1 (GLPl) (7-36)-NH2, is produced. 
The product GLPl (7-36)-NH2 is produced in several 

35 tissues and has been shown to be an incretin, and is 
commonly referred to as GLIP. The sequence of GLPl 
(7-36)-NH2 (SEQ ID N0:1) is: 
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-His-Ala-Glu-Gly-Thr-Phe-Thr-Ser- 
7 

5 Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 

Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu- 
26 27 

10 

Val-Lys-Gly-Arg-NHj 
34 36 

15 

According to the invention, the trypsin 
catalyzed transpeptidation reaction, causing 
substitution of the addition unit for the leaving unit, 
is in competition with the trypsin catalyzed hydrolysis 

20 at the carboxy terminus of the amino acid at the 
cleavage site. There are two ways to affect the 
reaction mixture to favor the transpeptidation process. 
In the first, the reaction is conducted in an aqueous 
buffer solution with reactant concentrations conducive 

25 to the transpeptidation process. Alternatively, organic 
solvents may be used to favor the transpeptidation 
process over hydrolysis. 

In the first variation, the recombinant 
polypeptide GLPl (7-34) core linked to the leaving unit 

30 -Ala-Phe-Ala at a -Lys- cleavage site (SEQ ID N0:2): 

-His-Ala-Glu-Gly-Thr-Phe-Thr-Ser- 
7 

Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 

35 

Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu- 
26 27 

40 Val-Lys-Ala-Phe-Ala 

34 

is dissolved in buffer. To the transpeptidation 
45 mixture is added the suitable addition unit, containing 
desired amino acid or peptide sequence. The amount of 
addition unit required is dependent on the dissociation 
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constant (Kh) of the endopeptidase-acyl intermediate to 
the recombinant polypeptide and the concentration of 
recombinant polypeptide in the mixture. Typically, the 
amount of addition unit is about one equivalent to 20 
5 times the of the addition unit to the acyl -enzyme 

intermediate, preferably 10 x K„ of the addition unit to 
the acyl-enzyme inteinnediate . For example, Gly-Arg-NHi 
or Gly-Arg-Gly are desired sequences which are suitable 
addition units for synthesis of the modified recombinant 
10 polypeptide product CLIP (SEQ ID N0:1) and GLPl 
(7-36)-Gly (SEQ ID N0:3), 

-His-Ala-Glu-Gly-Thr-Phe-Thr-Ser- 
7 

15 Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 



20 



Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu- 

26 27 

Val-Lys-Gly-Arg-Gly 
34 36 



25 respectively. The cleavage enzyme trypsin is added in 
an effective catalytic amount but not so great as to 
cause substantial secondary reactions such as cleavage 
at other sites, hydrolysis, and the like. The cleavage 
enzyme trypsin is added to the reaction mixture in a 

30 trypsin: polypeptide molar ratio of about 1:10 to 
1:500,000, preferably 1:100 to 1:100,000, and more 
preferably 1:200 to 1:50,000. 

The production of the modified recombinant 
polypeptide product GLIP or GLPl (7-36)-Gly is monitored 

35 by HPLC, laser desorption, mass spectrometry, or other 
analytical method, and the reaction stopped by the 
addition of an acid solution. The reaction procedure 
may be stopped by an acid solution at about pH 3, 
Suitable acid solutions for stopping the reaction 

40 include hydrochloric, sulfuric, acetic, and the like. 
Alternatively, the trypsin catalyzed 
competitive reactions of hydrolysis and transpeptidation 
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may be shifted in favor of transpeptidation through the 
use of organic solvents. Suitable solvents for the 
transpeptidation reaction, according to the method of 
the invention, include DMSO and 75% v:v, 
5 N,N'-dimethylacetamide and 95% v:v. Bongers et al.. 
Int. J. Peptide Protein Res. . 40;268 (1992). 

If the desired modified recombinant polypeptide 
product requires an amidated C-terminal amino acid but 
an addition unit including a non-amidated terminal amino 

10 acid was used, the C-terminal a-carboxyl group may be 
amidated in a further step. The C-terminal a-carboxyl 
group may be amidated as described by Bongers et al . , 
cited supra , for the GLPl (7-36)-Gly by the use of the 
C-terminal a-carboxyl amidating enzyme, as described in 

15 Henriksen et al., J. Am. Chem. Soc , 114 :1876-1877 
(1992); and Ohsuye et al., Biochem. Biophvs. Res. 
Commun . , 150 :1275 (1988). The foregoing references 
describe the procedure and are incorporated herein by 
reference. 

20 The modified recombinant polypeptide product is 

purified from the mixture by HPLC, ion exchange 
chromatography, hydrophobic interaction chromatography, 
or particle exclusion chromatography. To further reduce 
contamination, the separated product may be further 

25 purified by sequential use of the aformentioned methods. 
The recombinant polypeptide product may be used 
immediately or may be stored by lyopholization and 
cryopreservation at about -70*'C. 

In this variation, the endopeptidase cleavage 

30 enzyme trypsin cleaved the truncated GLPl core from the 
leaving unit at the 34-35 Lys-Ala cleavage site. (See 
SEQ ID N0:2). As stated earlier, trypsin is known for 
its cleavage site specificity at -Lys-X and -Arg-X 
bonds. It is noted that the GLPl (7-34) core also 

35 contains the trypsin cleavage site -Lys- at the 26-27 
amino acid position. This lysyl was not cleaved by 
trypsin. It is believed that the adjacent carboxylic 
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acid, side chain containing amino acid adjacent to *Lys«- 
rendered the -Lys- cleavage site less reactive. The 
method of the invention utilizes the knowledge that 
-Lys- followed by an amino acid with a carboxylic acid 
5 containing side chain is a poor substrate, and the 

discovery that lysyl glutamyl at 27-28 is not hydrolyzed 
during the time required for complete hydrolysis of 
lysl-histidyl at 6-7 and lysl-glysyl at 34-35 of GLPl 
(1-37). For example, -Glu- renders lysyl poor cleavage 

10 site in -Lys-Glu- a poor substrate for trypsin. This 
allows the serine peptidase trypsin to be utilized as a 
cleavage enzyme when there exists multiple recognized 
-Lys- cleavage sites, but only the desired cleavage site 
is not adjacent to an amino acid containing a carboxyl 

15 group containing side chain. 

2. Trypsin Transpeptidation of a 

Single Copy Recombinant Polypeptide 
Derived from a Fusion Protein Construct 

20 

The CLIP and GLPl (7-36)-Gly modified 
recombinant product polypeptides may be produced by the 
method of the invention starting with a recombinant 
polypeptide derived from a recombinant single copy 

25 fusion protein starting product. As discussed infra , a 
fusion protein construct serves as a carrier protein 
system for recombinant polypeptides and provides an 
efficient system for chromatographic purification. In 
this variation, the fusion protein construct is first 

30 purified from the other cell constituents, as described 
below at section II. (See this section also for 
definitions of the fusion protein terms.) Once the 
fusion protein construct is purified from the other cell 
constituents, preferably, the binding protein is 

35 separated from the recombinant single copy polypeptide. 
According to the method of the invention, the separation 
of the binding protein from the recombinant single copy 
polypeptide is accomplished by cleavage of the 
interconnecting polypeptide or amino acid. Depending on 
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the interconnecting polypeptide or amino acid used, 
cleavage may be accomplished by the use of a cleavage 
enzyme or chemical cleavage reagent. For example, the 
chemical cleavage agent cyanogenbromide (CNBr) in 70% 
5 formic acid cleaves the interconnecting amino acid 
methionine. Once the single copy polypeptide is 
released from the binding protein, it is separated from 
the binding protein by known methods in the art such as 
precipitation and chromatographic procedures including 
10 size exclusion, ion exchange, HPLC, and the like. Once 
purified, the GLPl ( 7-34 ) -Ala-Phe-Ala is transpeptidated 
according to the method of the invention, as described 
above . 

15 3. Trypsin Transpeptidation of a 

Multicopy Recombinant Polypeptide 

In a third variation, according to the method 
of the invention, a recombinant multicopy polypeptide is 

20 cleaved into recombinant single copy polypeptides 

simultaneous with the transpeptidation process. The 
niimber of single copy polypeptides which may be 
incorporated into a recombinant multicopy polypeptide is 
limited only by the physical capabilities of the 

25 specific expression system selected for expression of 
the recombinant multicopy polypeptide. The recombinant 
multicopy polypeptide may include multiple single copy 
core polypeptides with intraconnecting peptides between 
individual single copy core polypeptides, or the single 

30 copy core polypeptides may be linked directly to one 

another. In either alternative, the linkage between the 
intraconnecting polypeptide and core, or between 
directly linked individual cores, is prefer€ibly a 
cleavage site recognized by the endopeptidase cleavage 

35 enzyme. Further, in either variation of a multicopy 
polypeptide, only the terminal single copy polypeptide 
need be linked to a leaving unit. In the non-terminal 
single copy recombinant polypeptides, the downstream 
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polypeptide acts as a leaving group for the immediately 
preceding polypeptide* 

According to the method of the invention, when 
the recombinant multicopy polypeptide is composed of 
5 single copy core polypeptides linked by intraconnecting 
peptide, the potential peptides which may be used as 
intraconnecting peptides only require that the terminal 
ends are composed of amino acids which will not inhibit 
the cleavage activity of the endopeptidase cleavage 
10 enzyme at the linkage. For example, in one variation, 
the 1-6 amino acid sequence of the unprocessed natural 
form of 6LP1 may serve as an intraconnecting polypeptide 
between individual single copy GLPl (7-34) unit (SEQ ID 
N0:4): 

15 His-Asp-61u-Phe-61u-Arg-His-Ala 

' 1 7 

Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 

20 

Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 

Lys -Glu-Phe- I le-Al a-Trp-Leu 
25 26 27 

Val-Lys 
34 

30 In this embodiment, trypsin will cleave the recombinant 
multicopy polypeptide into recombinant single copy 
polypeptides at the -Arg- residue at amino acids 6-7 and 
at the -Lys- residue at amino acids 34-1 to yield single 
copies of core GLPl (7-34) and intraconnecting units of 

35 (SEQ ID N0:5) : 

His-Asp-Glu-Phe-Glu-Arg-His 
1 6 7 

40 

The reaction will also yield a single terminal 
Ala-Phe-Ala leaving group. When the reaction is 
conducted in the presence of an appropriate nucleophilic 
addition unit, such as Gly-Arg-NH2 or Gly-Arg-Gly, 
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transpeptidation occurs yielding the modified 
recombinant polypeptide products CLIP and 6LP1 
( 7-36 ) -Gly , respectively . 

It is further recognized that the 
5 interconnecting polypeptides may be cleaved before or 
after transpeptidation by a chemical or enzymatic 
cleavage agent. (See Table 2.) 

According to the method of the invention, the 
recombinant multicopy polypeptide is purified as 

10 described in Scopes et al., Protein Purification; 
Principles and Practice , Springer-Verlag, New York 
(1987), which is incorporated herein by reference. The 
purified multicopy recombinant polypeptide is then 
further processed according to the method of the 

15 invention. As previously discussed for the recombinant 
single copy polypeptide, the reaction is conducted, 
preferably, in a buffered solution at pH 5-11. As 
described earlier, the amount of addition unit required 
is in the range of one equivalent up to 20 x of the 

20 enzyme to the addition unit. Trypsin is added in a 

trypsin: polypeptide ratio of preferably about 1j200 to 
1:50,000. Simultaneous cleavage of the recombinant 
multicopy polypeptide and transpeptidation yields 
multiple copies SEQ ID NO: 5, multiple copies of 

25 recombinant GLPl (7-34) core, and one Ala-Phe-Ala 

leaving group. The Gly-Arg-NHz or Gly-Arg-Gly addition 
units act as a nucleophile and transpeptidation occurs 
at amino acid residue 34. The production of modified 
recombinant polypeptide GLIP or GLPl {7-36)-Gly product 

30 is monitored by HPLC or other analytical technique and 
the reaction stopped by the addition of a suitable acid 
as described above. 

In an alternative variation, the modified 
recombinant polypeptide products may be formed, by the 

35 method of the invention, through simultaneous cleavage 
and transpeptidation of recombinant multicopy 
polypeptide units composed of multiple single copy 
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polypeptide units connected without intervening 
intraconnecting peptides* According to this variation 
of the invention, for example, a recombinant multicopy 
polypeptide multiple single copy 6LP1 (7-34) cores with 
5 a terminal GLPl (7-34) core linked to a Ala-Phe-Ala 
leaving unit is expressed. The expressed multicopy 
recombinant polypeptide is purified from cell 
constituents, as previously described. The multicopy 
construct is treated with the endopeptidase enzjrme 

10 trypsin, as previously described by the method of the 
invention. Trypsin will cleave the multicopy 
polypeptide into single copy polypeptides at the 34-7 
-Lys- residue (see SEQ ID NO. 3). Simultaneous with 
cleavage, the single copy polypeptides will undergo 

15 transpeptidation, as previously described, yielding the 
GLIP or GLPl (7-36)-Gly products in the presence of 
Gly-Arg-NH2 or Gly-Arg-Gly addition units, respectively. 

In another variation, it is further recognized, 
that the GLPl (l-36)-NH2 and GLPl (l-36)-Gly modified 

20 recombinant product may be prepared, according to the 
method of the invention, using mutant forms of trjrpsiii. 
In this variation, a multicopy recombinant polypeptide, 
as previously described, is synthesized using multiple 
single copy GLPl. (1-34) units (SEQ ID N0:6) 

25 

His-Asp-Glu-Phe-Glu-Arg-His-Ala 
1 7 

Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 

3 0 

Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 

35 Lys-Glu-Phe-Ile-Ala-Trp-Leu 
26 27 



40 



Val-Lys 
34 
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contiguously connected without an intraconnecting 
peptide and with a leaving group only at the terminal 
polypeptide. The mutant trypsin endopeptidase enzymes 
used for this variation have a decreased rate of 
5 cleavage at the -Arg- site and a normal rate of cleavage 
at the -Lys- cleavage site. These mutant forms will, 
therefore, cleave the recombinant polypeptides at the 
34 -Lys- residue, but will not cleave at the -Arg- 6 
residue yielding multiple single copy GLPl (1-36) and a 

10 single leaving group. In the presence of a suitable 
addition unit such as Gly-Arg-NHj or Gly-Arg-Gly, under 
the conditions of the invention, the GLPl (1-34) core 
units will be transpeptidated yielding the GLPl 
{l-36)-NH2 or GLPl ( 1-3 6 )-Gly^ products. 

15 The foregoing transpeptidation processes 

described for a multicopy recombinant polypeptide may 
alternatively be conducted in organic solvents conducive 
to the transpeptidation process, as described earlier. 

20 4. Trypsin Transpeptidation of a 

Multicopy Recombinant Polypeptide 
Derived from a Fusion Protein Construct 

The modified recombinant polypeptide products 
may be produced, according to the method of the 
invention, by transpeptidation of recombinant single 
copy core polypeptide units which have been derived from 
a multicopy polypeptide unit which has been derived from 
a fusion protein construct. The number of recombinant 
single copy core polypeptides included within the 
recombinant multicopy polypeptide is limited only by the 
physical capabilities of the chosen expression system. 

The multicopy fusion protein construct is 
formed, purified from the other cell constituents, and 
the binding protein is separated from the recombinant 
polypeptide, as described at section II. The purified 
recombinant multicopy polypeptide, separated from the 
binding protein, is then further treated as described 
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30 



35 
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above to yield the desired modified recombinant 
polypeptide products. 

B. Transpeptidation Using the Endopeptidase 
5 Thrombin 

Another example of the endopeptidase which may 
act as a cleavage enzyme according to the method of the 
invention is thrombin. As described earlier, thrombin 
has a cleavage site preference at the carboxy end of 
-Arg-/ (y-Arg-X), within the known recognition sequence 
Gly-Pro-Arg. A discovery of the present invention is 
that thrombin also cleaves at the carboxy end of -Arg- 
(Y-Arg-X) within the cleavage recognition sequence 
Gly-Ala-Arg. The discovery Qf this recognition sequence 
renders the endopeptidase enzyme thrombin new and 
substantial utility in preparation of modified 
recombinant polypeptides by the method of this invention 
and other recombinant methodologies. In the past, the 
recombinant ly produced growth hormone releasing factor 
(GRF) (l-44)-NH2 was produced through the use of an 
a-amidating enzyme. By the method of the present 
invention, the amidated form of GRF may be synthesized 
through the use of an appropriate addition unit to a 
core, or by amidation of an addition unit after 
transpeptidation by the method of the invention. 

1. Thrombin Transpeptidation of a 

Sincrle Copy Recombinant Polypeptide 

The transpeptidation process of the present 
invention, utilizing the endopeptidase enzyme thrombin, 
is a one-step reaction. As discussed earlier for 
trypsin, conditions are maintained to favor the 
35 competing reaction of hydrolysis and transpeptidation • 
Within an aqueous environment, the reaction is conducted 
in a buffer solution capable of maintaining pH at about 
pH 2-13, preferably pH 3-12, and more preferably pH 
5-11. Suitable buffers for the present invention are as 
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previously described for trypsin • Using the serine 
endopeptidase thrombin as a cleavage enzyme for cleavage 
and transpeptidation, the recombinant polypeptide 
includes a GRF (1*41) core linked to leaving unit. A 
5 known leaving unit is Ala-Arg-Leu-Ala. It is recognized 
that there are many potential leaving units , including 
peptides and single amino acids such as -Ala-. The 
sequence of GRF (1-41) (S£Q ID NO: 7) is: 

10 

Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- 
15 13 



Asp-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- 
25 

2 0 Glu-Arg-Gly-Ala-Arg 
37 41 

25 A suitable addition unit for synthesis of GRF (l-44)-NH2 
is Ala-Arg-Leu-NH2, and for synthesis of GRF (l-44)-Gly/ 
a suitable addition unit is Ala-Arg-Leu-Gly (SEQ ID 
NO: 8). The present variation uses the discovery that 
thrombin recognizes the cleavage site -Arg- within a 

30 Gly-Ala-Arg recognition sequence. 

This knowledge is used to cleave the 
Ala-Arg-Leu-Ala leaving unit from the core at the -Arg- 
within Gly-Ala-Arg. To the recombinant polypeptide GRF 
(1-41) -Ala-Arg-Leu-Ala is added the addition unit at an 

35 amount of about one equivalent to 20 times .the of the 
addition unit to the acyl -enzyme intermediate, 
preferably 10 x k" of the addition unit to the aceyl- 
enzyme intermediate. For example, Ala-Arg-Leu-NHz or 
Ala-Arg-Leu-Gly (SEQ ID NO: 8) are suitable addition 

40 units for synthesis of the modified recombinant GRF 
(l-44)-NH2 and GRF (l-44)-Gly products. The cleavage 
enzyme thrombin is added to the mixture in a 
thrombin: polypeptide ratio of about 1:10 to 1:500,000, 
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preferably 1:100 to 1:100,000, and more prefercUaly 1:200 
to 1:50,000. 

The production of GRF (l-44)-NH2 is monitored by 
HPLC or other appropriate analytical technique and the 
5 reaction stopped by the addition of an enzyme inhibitor 
such as phenyl methane sulfonyl flouride (PMSF) or 
diisopropyl phosphoryl fluoridate (DPF). The modified 
recombinant polypeptide is separated from the reaction 
mixture by reverse phase chromatography, hydrophobic 

10 interaction chromatography, ion exchange chromatography, 
or HPLC, The recombinant polypeptide product may be 
stored at about -20**C to about -80**C after 
lyopholazation • 

Alternatively, the t^hrombin catalyzed 

15 competitive reactions of hydrolysis and 

transpepetidation may be shifted in favor of 
transpeptidation through the use of organic solvents. 
Suitable solvents for the transpeptidation reaction, 
according to the method of the invention, include DMSO 

20 and 75% v:v N,N' -dime thylacet amide and 95% v:v. Bongers 
et al., cited supra . 

2. Thrombin Transpeptidation of a Single Copy 
Recombinant Polypeptide Derived from a 
25 Fusion Protein Construct 

Recombinant GRF (l-44)-NH2 can be prepared, 
according to the method of the invention, from a 
recombinant polypeptide derived from a single copy 

30 fusion protein construct. The expression of the single 
copy fusion protein construct is described infra. In 
brief, the binding protein of the fusion protein 
construct will be connected to the single copy 
recombinant polypeptide through an interconnecting 

35 peptide. The interconnecting peptide may be a single 
amino acid which is cleavable by a chemical cleavage 
agent or a peptide which terminates with an amino acid 
sequence recognizable by a cleavage enzyme. For 
example, the tetrapeptide Asn-Gly-Pro-Arg (SEQ ID NO: 9) 
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is a suitable interconnecting peptide for the fusion 
protein construct containing the GRF ( 1-4 1 ) -Ala-Arg-Leu- 
Ala single copy recombinant polypeptide. 

Once expressed, the fusion protein construct is 
5 purified from other cell constituents, and the single 
copy recombinant polypeptide is then separated from the 
binding protein, as described in copending U.S. patent 
application Serial No. 07/552,810, the disclosure of 
which is incorporated herein by reference. For example, 

10 a human carbonic anhydrase fusion protein may be 
separated from the interconnecting peptide of the 
sequence, Asn-Gly-Pro-Arg (SEQ ID N0:9), through the use 
of guanidine hydrochloride. 

The cleaved single copy recombinant GRF (1*41)- 

15 Ala-Arg-Leu-Ala peptide can be separated from the 
binding protein by normal chromatographic methods 
including ion exchange, reverse phase, and size 
exclusion. Alternatively, the recombinant single copy 
polypeptide may be separated from the carrier protein by 

20 standard precipitation methods. The purified single 
copy recombinant GRF (1-41) -Ala-Arg-Leu-Ala is then 
treated according to the method of the invention, as 
previously described in the presence of an 
Ala-Arg-Leu-NHj or Ala-Arg-Leu-Gly (SEQ ID NO: 8) addition 

25 units to yield the modified recombinant products GRF 
(l-44)-NH2 and GRF (l-44)-Gly. 



3. Thrombin Transpeptidation of a 

Multicopy Recombinant Polypeptide 

30 

In a third variation, according to a method of 
the invention, thrombin may be used to simultaneously 
cleave and trans peptidate a recombinant multicopy 
polypeptide to form the desired modified recombinant 
35 polypeptide product. In this variation, the recombinant 
multicopy polypeptide is produced by methods discussed 
Section II. The multiple single copy recombinant GRF 
(1-41) cores are linked together without the use of an 
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intraconnecting peptide. The terminal GRF (1-41) core 
is linked to an -Ala- leaving unit. The GRF (41-1) 
linkages are prefixed by the thrombin recognition 
sequence -Gly-Ala-Arg- and cleavage occurs at the -Arg*^- 
5 carboxy group (see SEQ ID NO: 7) 

The recombinant multicopy polypeptide is 
purified, as discussed Section II. The number of single 
copy polypeptides which may be linked within the 
recombinant multicopy polypeptide is limited only by the 

10 physical capabilities of the expression system. The 
recombinant multicopy polypeptide is added to buffer 
solution with thrombin and an addition unit, as 
described above. Also as described above, the reaction 
may be conducted in organic solvents to favor the 

15 transpeptidation reaction. 

In this variation, the thrombin recognition 
site of -Gly^^-Ala*^-Arg^^- is also utilized to facilitate 
the cleavage of GRF ( 1-41)-Ala-Arg-Leu-Ala at the 
-Arg'^^-Ala'^^-Arg^^-Leu'^'^-Ala'^^ linkage of the terminal 

20 single copy recombinant polypeptide. In the presence of 
a suitable nucleophile such as Ala-Arg-Leu-NHj or 
Ala-Arg-Leu-Gly (SEQ ID N0:8), the desired modified 
recombinant polypeptide products are produced through 
transpeptidation simultaneous with cleavage of the 

25 multicopy recombinant polypeptide. 



4. Thrombin Transpeptidation of a Multicopy 
Recombinant Polypeptide Derived from a 
Fusion Protein Construct 

30 

The modified recombinant multicopy polypeptide 
products may be produced, according to the method of the 
invention by transpeptidation of recombinant single copy 
core polypeptide units which have been derived from a 
35 multicopy polypeptide unit which has been derived from a 
fusion protein construct • The number of recombinant 
single copy core polypeptides included within the 
recombinant multicopy fusion protein construct is 
limited only by the physical capabilities of the 
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expression system. Purification of the recombinant 
multicopy polypeptide from the fusion protein 
construction is as previously described. The purified 
recombinant multicopy polypeptide is separated from the 
5 fusion protein construct and then treated with thrombin 
endopeptidase enzyme, as described above, to yield the 
desired modified recombinant polypeptide product. 

The multicopy fusion protein construct is 
prepared as described Section II. 

10 

IX. Forming the Recombinant Single- or 

Multicopy Polypeptide and the Single- or 
Multicopy Recombinant Fusion Protein Construct 

The recombinant single- or multicopy 

15 polypeptide or the single- or multicopy recombinant 

fusion protein construct is formed by recombinant DN& 

methods disclosed in U.S. application Serial No. 

07/552,810, the disclosure of which is incorporated 

herein by reference. The gene sequence for the desired 

20 recombinant polypeptide can be cloned or, in the case of 

a smaller peptide, synthesized by automated synthesis. 

The gene sequence encoding the leaving unit is linked at 

the C-terminal end of the core polypeptide. 

For conciseness, the term "fusion protein 

25 construct" will be used to refer to either the single- 

or multicopy recombinant fusion protein. The term 

"polypeptide construct" will be used to generically 

refer to the recombinant single- or multicopy 

polypeptide . 

30 The expression vector containing the 

recombinant gene for a polypeptide construct or fusion 
protein construct is capable of directing expression of 
the recombinant gene in prokaryotic or eukaryotic cells. 
The expression vector incorporates the recombinant gene 

35 and base vector segments such as the appropriate 

regulatory DNA sequences for transcription, translation, 
phenotyping, temporal or other control of expression, 
RNA binding and post-expression manipulation of the 
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expressed product. The expression vector generally will 
include structural features such as a promoter, an 
operator, a regulatory sequence and a transcription 
termination signal. The expression vector can be 
5 synthesized from any base vector that is compatible "^with 
the host cell or higher organism and will provide the 
foregoing features . The regulatory sequences of the 
expression vector will be specifically compatible or 
adapted in some fashion to be compatible with 

10 prokaryotic or eukaryotic host cells or higher 

organisms. Post-expression regulatory sequences, which 
cause secretion of the polypeptide construct can be 
included in the eukaryotic expression vector. It is 
especially preferred that the expression vector exhibit 

15 a stimulatory effect upon the host cell or higher 
organism such that the polypeptide construct is 
overproduced relative to the usual biosynthetic 
expression of the host. 

Transformed prokaryotic or eukaryotic cells or 

20 higher organisms carrying the appropriate recombinant 
prokaryotic or eukaryotic vectors constitute the 
transformed cells of this invention. The prokaryotic 
cells useful as hosts include any that are amenable to 
expression of foreign protein. Preferred embodiments 

25 include coli and subtilis. The eukaryotic cells 
include unicellular organisms, such as yeast cells, as 
well as immortal cells from higher organisms, such as 
plant, insect or mammalian cells. Preferred eukaryotic 
cells include Saccharomyces cerevisiae, Pichia pastoris ^ 

30 Aspergillus niqer , Spodoptera f rupiperda , and com, 
tobacco or soybean plant cells. The higher organisms 
useful as hosts include higher order plants and animals 
having germ cells that are amenable to transformation. 
Included are plants such as tobacco, corn, soybean and 

35 fruit bearing plants, and invertebrate and vertebrate 
animals such as fish, birds and mammals especially 
including sheep, goats, cows, horses and pigs. 
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The invention as well includes a cultured, 
transformed cell or transformed plants or animals that 
are capable of expressing the fusion protein or 
polypeptide construct composed of a core of at least one 
5 leaving unit, wherein the leaving unit is linked to the 
core by an enzyme cleavage site and may be substituted 
by an addition unit when transpeptidated by an 
endopeptidase cleavage enzyme. 

The expression steps of the method according to 

10 the present invention are based upon microbial or higher 
organism protein expression. The steps call for 
inserting the recombinant gene into an appropriate base 
vector, transforming host cells or higher organisms with 
the resulting recombinant vector and expressing the 

15 polypeptide construct or fusion protein construction, 
preferably as a soluble product within the host cell or 
higher organism, as a product that is insoluble in the 
cell cytoplasm, or as a secreted product by the host 
cell or higher organism. When higher organisms are 

20 chosen as the host, fertilized germ cells of that 

organism are transformed and the transformed organism 
grown through usual maturation techniques. 

The purification steps of a polypeptide 
construct call for separating the polypeptide construct 

25 from other cellular constituents, debris, and culture 
meditim. The purification steps of a fusion protein 
construct call for affinity binding of the fusion 
protein construct to an immobilized ligand, and 
separating it from other cellular constituents, debris 

30 and culture medium. The polypeptide portion of the 
fusion protein construct is obtained from the 
immobilized fusion protein construct through enzymatic 
or chemical cleavage action upon the interconnecting 
peptide, and separating the variable fused polypeptide 

35 from the cleavage enzyme or other material. (Throughout 
this application, mention of enzymatic or chemical 
cleavage alone will be understood to include both.) 
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Alternatively, the purification steps can 
separate the entire fusion protein construct from the 
inunobilized ligand after purification and cleave it with 
an immobilized cleavage enzyme or chemical reagent to 
5 produce a mixture containing the variable fused 

polypeptide and binding protein. This mixture can be 
separated by use of an immobilized ligand for the 
binding protein and removal of the purified polypeptide 
construct . 

10 Preferred embodiments of the method include 

those expressing the recombinant gene composed of DHA 
segments for human carbonic anhydrase (or a modified 
functional version thereof), interconnecting peptide and 
a recombinant single copy polypeptide or multiple units 

15 thereof. Additional preferred embodiments include use 
of coli or yeast as the host cells and use of 
controlled expression by means of any induction system 
such as temperature, nutrients, isopropyl 
thiogalactoside, indole acrylic acid, carbon sources and 

20 the like, so as to allow the production of a protein 
purification construct that would be toxic to the host. 
Further preferred embodiments include use of an 
expression vector system for prokaryotic cells which 
incorporates a two plasmid construction, and an 

25 expression vector system for yeast cells which 
incorporates a shuttle vector with an origin of 
replication for coli and one for cerevisiae . 

Due to E. coli digestion of single copy 
recombinantion polypeptide expressed intracellular ly, 

30 incorporation into a fusion protein construct is 

required. The possibility of an E. coli organism, which 
does not degrade intracellularly expressed single copy 
recombinant polypeptides not attached to a carrier 
protein, is recognized according to the method of the 

35 invention. 



30 



A. Recombinant Polypeptide Production 
from a Fusion Protein Construct 

1. Method for Expression of Host Cells 
of Fusion Protein Construct 

The methods for expression of single- and 
multicopy recombinant fusion protein products disclosed 
in U.S. patent application Serial No. 07/552,810, filed 
July 16, 1990, the disclosure of which is incorporated 
herein by reference. 

As discussed in U.S. Serial No. 07/552,810, the 
use of multicopy or single copy recombinant fusion 
proteins allows for the highly efficient purification of 
recombinant polypeptides. The construct of a 
recombinant fusion protein has a three tandem segments. 
The first segment is a binding protein which exhibits 
strong, reversible binding to a specific small molecular 
weight ligand. The second segment is an interconnecting 
peptide which is selectively cleavable by an enzyme or 
chemical technique. The interconnecting peptide 
connects the binding protein to the N- or C- terminal end 
of the recombinant single copy or multicopy polypeptide. 
It is typically a short chain peptide. It is preferred 
to construct the fusion protein construct gene so that 
the binding protein gene fragment is read first. The 
third segment, the variable fused polypeptide, 
incorporates any natural or synthetic polypeptide 
desired as a starting product for the method of the 
invention . 

2. Method of Purification of 
Fusion Protein Construct 

The recombinant single or multicopy polypeptide 
produced as a fusion protein allows for easy 
purification by affinity chromatography. The fusion 
protein produced in the transformed cells can be soluble 
in the cells or insoluble in inclusion bodies. Soluble 
fusion protein construct is obtained by lysis of the 
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f 

transformed cells to form a crude cell lysate. The 
crude cell lysate can be further purified by methods 
including ultrafiltration and ion exchange 
chromatography before purification by affinity 
5 chromatography « Insoltible fusion protein in inclusion 
bodies is also purified by similar methods. 

To perform affinity purification, the crude 
mixture of materials is combined with an immobilized 
ligand for the binding protein. Examples of the binding 

10 protein, corresponding ligand and dissociation constants 
are given in Table !• A complete discussion of the 
method of purification of the fusion protein construct 
is found in copending application Serial No. 07/552,810, 
the disclosure of which is incorporated herein by 

15 reference. 
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15 



20 



25 



30 



35 



40 



Binding Protein 
Xanthine Oxidase 
Adenosine deaminase 
Adenosine deaminase 
Adenosine deaminase 



TABLE 1 

Lieand 

Allopurinol 

Coformycin 

De ozy c o f o nny c in 



Kd 

strong 
<1.2i:-10 
2.SE-12 



erythro-9- (2-hydroxy-3 1.6E-9 
nonyl) adenine 



10 Dihydrofolate reductase Methotrexate 

Dihydrofolate reductase Methotrexate 

Dihydrofolate reductase Aminopterin 

Dihydrofolate reductase Trimethoprin 
Ribulose bisphosphate 



carboxylase 
Pepsin 
Calmodulin 
Calmodulin 
Cholesterol esterase 
Carbonic anhydrase II 
Carbonic anhydrase II 



2 carboxyarabirital 
1,5 bisphosphate. 

Pepstatin 

Melittin 

Various peptides 

Borinic acid 

Sulfanilamide 

Acetazolamide 



1.2E-9 
2.3E-9 
3.7E-9 
4.6E-9 
IE- 14 

IDE- 9 

3E-9 

0.2E-9 

O.lE-9 

4.6E-7 

6 E-10 



E is times ten to the negative exponent indicated. 
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For the preferred carbonic anhydrase enzyme, 
the llgand is sulfanilamide or a benzene sulfonamide 
derivative. Immobilization of the ligand on a solid 
support can be accomplished by the methods of W. 
5 Scouter, Methods Enzymol . , 34, 288-294 (1974); S. 

Marcus, Methods Enzymol , , 34 . 377-385 (1974); A. Matsura 
et al.. Methods Enzymol , . 34, 303-4 (1974); R. Barker, 
Methods Enzymol > , 34, 317-328 (1974); !• Matsumoto, 
Methods Enzymol . , 34, 324-341 (1974), J. Johansen, 

10 Carlsbero Res> Commun. , 14, 73 (1976) and 6. S. Bethell 
et al., J. Biol. Chem. . 254 , 2572-2574 (1979); the 
disclosures of which are incorporated herein by 
reference. The fusion protein binds to the immobilized 
ligand through the reversible affinity of the binding 

15 protein for its ligand. The remaining constituents and 
debris of the crude mixture of materials can then be 
removed by washing or similar techniques. 

Two routes can be employed for further 
purification of the fusion protein. According to the 

20 first route, the single or multicopy fusion protein is 
dissociated intact from the immobilized ligand by 
washing with a strong competing ligand solution. 
Examples include cyanides, pseudocyanides such as 
thiocyanides , perchlorates , halide and similar strong 

25 Lewis bases. 

According to the second route, the immobilized 
single or multicopy fusion protein is contacted directly 
with cleavage reagent to release the single or multicopy 
polypeptide. To isolate the single or multicopy 

30 polypeptide in the second route, its mixture with 
cleavage enzyme can be combined with a means for 
molecular weight selection (e.g. partition 
chromatography dialysis, filtration based on molecular 
size or high pressure liquid chromatography on a 

35 "particle exclusion" base or ion exchange 

chromatography) such that the high molecular weight 
cleavage enzyme is separated from the free variable 
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fused peptide. Or, the mixture can be combined with an 
immobilized affinity material for the cleavage enzyme. 

The cleavage enzyme chosen will depend upon the 
interconnecting peptide chosen. Examples of cleavage 
enzymes and their cleavage sites are given in Table 2. 



TABLE 2 



10 



15 



20 



25 



30 



35 



40 



45 



Enzymes 

for Cleavage 

Enterokinase 
Factor Xa 
Thrombin 

Ubiguitin Cleaving Enzyme 

Renin 

Trypsin 
Chymotrypsin 
Clostripain 
S. aureus V8 

Chemical 
Cleavage 

(at pH3) 

( Hydroxylamine ) 

(CNBr) 

BNPS-skatole 



DMA Sep. 

6AC6AC6ACGATAAA 

(SEQ ID NO: 10) 

ATTGAAGGAAGA 

(SEQ ID NO: 11) 

A6AGGACCAAGA 

(SEQ ID NO: 12) 

A6AGGAGGA 

(SEQ ID NO: 13) 

CATCCTTTTCATCTGCTGGTTTAT 
(SEQ ID NO: 14) 

AAA OR C6T 

TTT or TAT or TGG 

CGT 

GAA 



DNA Seo. 

GATGGA 

AATCCA 

AT6 

TGG 



2-Nitro-5- thiocyanobenzoate TGT 



50 
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20 



35 

The purification methods described above yield 
the starting materials for the method of the invention: 
a single copy recombinant fusion protein^ a multicopy 
recombinant fusion protein, a single copy recombinant 
polypeptide, or a multicopy recombinant polypeptide • In 
a preferred embodiment, the recombinant single and 
multicopy polypeptides are produced from a fusion 
protein* 

B. Recombinant Polypeptide Production from 
a Recombinant Polypeptide 

1. Recombinant Method for Expression of 
15 Host Cells of Multicopy Polypeptide 

The method for expression of single- and 
multicopy recombinant polypeptide, i.e, a polypeptide 
expressed with a leader sequence, a limiting protein or 
an affinity moiety attached to it, are known in the art 
and described in Protein Purification; From Mechanisms 
to Large -scale Processes , Michael Ladisch, editor; 
American Chemical Society, publisher (1990), the 
disclosure of which is incorporated herein by reference, 

25 

2. Method of Purification of 
Recombinant Multicopy Polypeptide 

The method for purification of a recombinant 
30 multicopy polypeptide is known in the art and is 

described in Kirshner et al,, J, Biotechnology , 12:247- 
260 (1989), the disclosure of which is incorporated 
herein by reference. 

35 I I I. Therapeutic Use of Recombinant Modified Polypeptide 
Products Produced by the Method of the Invention 

The products of the present invention have 
significant therapeutic and supplemental physiological 
40 uses in clinical human and veterinary medical practice. 
For example, the insulinotrophic activity of GLPl (7- 
36)-NH2 has been shown to be beneficial in treating the 
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symptoms of non-insulin dependent diabetes mellitus 
(NIDDM, Type II). Gutniak, New Eng. J. Med. , 326:1316-2 " 
(1992). 6RF (l-44)-NH2 is of therapeutic benefit for 
diseases such as short stature syndrome, endometriosis, 
5 and osteoporosis. In addition, supplemental GRF has 

been used to increase the lean to fat ratio in livestock 
allowing production of more wholesome meat products. 

Methods of preparation of pharmaceutically 
functional compositions of the products of the 

10 invention, in combination with a physiologically 

acceptable carrier, are known in the art. A functional 
pharmaceutical composition must be administered in an 
effective amount, by known routes of administration, for 
which the dosage is dependent on purpose for use and the 

15 condition of the recipient. 



EXAMPLE 1 

Preparation of Amidated 
Recombinantly Produced GLPl (7-36) -NH2 
20 From a Single Copy Fusion Protein Construct 

The naturally occurring sequence of Glucagon 
Like Peptide 1 (GLPl) (SEQ ID NO: 15) is: 

His-Asp-Glu-Phe-Glu-Arg-His-Ala 
1 7 

Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 
Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 



25 



30 



Lys-Glu-Phe-Ile-Ala-Trp-Leu 
35 26 27 

Val-Lys-Gly-Arg- NHj 
34 36 

40 

A GLPl peptide is a 36 amino acid peptide that 
has been recombinantly produced but without a mechanism 
for providing for the amidation of the C-terminal 
arginine residue. In this example, the method of the 
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Invention has been designed to produce a single copy 
fusion protein construct containing one copy of a gene 
encoding a truncated core GLPl and amidating the core 
6LP1 by a transpeptidation reaction, using the 
5 endopeptidase trypsin, to form a modified recombinant 
GLPl polypeptide. 

The strategy involves forming a DNA construct 
encoding a single copy recombinant fusion protein. The 
single copy fusion protein includes at least three 

10 segments. The first segment is a binding protein which 
exhibits strong reversible binding to a specific small 
molecular weight ligand. The second segment is an 
interconnecting peptide which is selectively cleavable 
by an enzyme or chemical technique. The third segment 

15 is a variable fused peptide containing one copy of the 
desired nat\iral or synthetic polypeptide, in this case 
GLPl ( 7-34 ) . The formation of a DNA construct for the 
fusion protein, as well as the fusion protein itself, 
has been described in copending U.S. Application Serial 

20 No. 07/552,810 filed July 16, 1990, which is hereby 
incorporated by reference. 

The single copy fusion protein can be formed 
with human carbonic anhydrase modified at residue 240 as 
the binding protein. The modification of carbonic 

25 anhydrase at residues 240 involves a substitution of a 
leucine for a methionine. The interconnecting peptide 
is a methionine residue which can be cleaved by cyanogen 
bromide. The variable fused polypeptide contains a 
single copy of a modified truncated GLPl peptide having 

30 the following sequence (SEQ ID N0:2): 
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His-Ala-Glu-Gly-Thr-Phe-Thr- 
7 

Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu- 

5 

Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile- 
26 27 

10 Ala-Trp-Leu-Val-Lys-Ala-Phe-Ala 

34 37 

The core GLPl peptide is truncated from the 
native sequence so that it contains residues 

15 corresponding to residues 7-34 of the naturally 

occurring sequence. The GLPl peptide is modified by the 
linkage of an Ala-Phe-Ala leaving unit at residues 35- 
37. This tripeptide is not found in the naturally 
occurring sequence and is a good leaving group for 

20 trypsin transpeptidation. Briefly, this single copy 
recombinant fusion protein can be produced from a DNA 
construct formed as follows. The DNA sequence from the 
human carbonic anhydrase II gene is modified so that the 
methionine codon at amino acid residue 240 is replaced 

25 with a leucine codon using cite directed mutagenesis, as 
described in Sambrook et al . , Molecular Cloning > A 
Laboratory Manual , Cold Spring Harbor Laboratory, N.Y. 
(1989). The modified gene for human carbonic anhydrase 
is then cloned into an expression vector which is 

30 compatible with E. coli , such as pB0304, as described in 
U.S. application Serial No. 07/552,810. A non- 
essential preferred embodiment is a short DNA fragment 
including the codon for methionine is chemically 
synthesized and inserted immediately downstream from the 

35 end of the gene for human carbonic anhydrase by stand€u:d 
methods. A DNA sequence encoding the truncated core 
GLPl (7-34)-Ala-Phe-Ala polypeptide is formed by 
automated DNA synthesis and inserted directly downstream 
from the interconnecting DNA segment encoding the 

40 methionine codon. The final recombinant expression 
vector encoding the single copy fusion protein is 
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transformed into E. coli by standard methods and the 
expressed recombinant single copy fusion protein can be 
obtained using affinity chromatography with 
sulfanilamide or by other chromatographic methods. Once 
5 the recombinant fusion protein is purified, it can be 
cleaved and transpeptidated. 

Cleavage and transpeptidation can be conducted 
as follows. For example, a 40 mg/ml solution of HCA- 
Met-GLPl {7-34)-Ala-Phe-Ala can be digested with a 

10 50-fold excess of cyanogen bromide (CNBr) methionine in 
70% formic acid to release the 6LP1 (7-34)-Ala-Phe-Ala 
peptide. The reaction mixture can be incubated in the 
dark under oxygen-free nitrogen at 20*-25"C for 16-24 
hours. The reaction mixture ^is diluted with 15 volumes 

15 of water and freeze dried. For the complete removal of 
acid and by-products, the freeze drying can be repeated 
after further addition of water. This cleavage reaction 
yields human carbonic anhydrase and the recombinant GLPl 
(7-34)-Ala-Phe-Ala polypeptide. 

20 The cleaved GLPl 7-34 Ala-Phe-Ala polypeptide 

can be separated from human carbonic anhydrase by normal 
chromatographic methods, i.e., ion exchange, reverse 
phase, or by size exclusion. In addition, the cleaved 
GLPl (7-34) -Ala-Phe-Ala polypeptide can be separated 

25 from the human carbonic anhydrase by simple 

precipitation procedure. A solution containing carbonic 
anhydrase, 70% formic acid, cyanogen bromide, 
methionine, and peptide is diluted with water to a 
protein concentration of 20 mg/ml while maintaining an 

30 acetic acid concentration of 10%. The addition of 5.6 
g/100 ml of sodium sulfate (NajSOJ to this mixture 
results in a precipitate which can be removed by 
centrifugation at 10,000 x g for 10 minutes. The 
carbonic anhydrase can be quantitatively precipitated 

35 and greater than 80% of the peptide remains in solution. 
The supernatant can be applied to an open C-8 column 
which is rinsed with four column volumes of 10% acetic 
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acid. The GLPl (7-34) -Ala-Phe-Ala can be eluted from 
the column with 50% acetonitrile in 10% acetic acid. 
The peptide can then be freeze dried. 

Once purified, the GLPl (7-34)-Ala-Phe-Ala can 
5 be transpeptidated to yield the modified recombinant 
native GLPl 7-36-NH2 smino acid product as follows. 

The recombinant GLPl (7-34)-Ala-Phe-Ala 
polypeptide can be cleaved with trypsin at the cleavage 
site between amino acid residues 34 and 35 at the Lys- 

10 Ala bond in the recombinant truncated polypeptide. 

Trypsin did not cleave the Lys-Glu bond of residues 26 
and 27 in experiments conducted on the recombinant GLPl 
polypeptide as shown "in SEQ ID NO: 2. While not in any 
way meant to limit the invention, it is believed that 

15 cleavage at residues 26 and 27 by trypsin is not favored 
because of the presence of the acidic glutamic acid 
residue. 

The cleavage with trypsin is conducted in the 
presence of either Gly-Arg-NHj or Gly-Arg-Gly addition 

20 units so that the cleavage of the Ala-Phe-Ala leaving 
unit is followed by the addition of Gly-Arg-NH2 or Gly- 
Arg-Gly to the core GLPl (7-34) polypeptide to yield 
either amidated 7-36 6LPI-NH2 polypeptide or GLPl 7-36 
peptide with a terminal glycine. 

25 For example, the freeze dried GLPl (7-34) 

-Ala-Phe-Ala is dissolved at 10 mg/ml in a buffer at 
pH 5-11 with 0.01 to 1 M Gly-Arg-NH2 or Gly-Arg-Gly 
leaving unit which contains bovine trypsin at a 1:1000 
ratio (trypsin: peptide) at 37"C. The mixture was 

30 stirred using a magnetic stirrer at 1000 rpm. The 
trypsin cleaves the Ala-Phe-Ala from the carboxy 
terminus of the core and forms an acyl-enzyme 
intermediate to residue 34 of the core. The Gly-Arg-NHj 
or Gly-Arg-Gly acts as a nucleophile favoring 

35 transpeptidation of the acyl-enzyme intermediate. The 
first reaction is: 
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GLPl 7-34 Ala-Phe-Ala + Gly-Arg-NHj Trypsin -> 

pH = 5-11 

GLPl (7-36)-NH2 

5 

The second reaction is: 

10 GLPl (7-34) Ala-Phe-Ala + Gly-Arg-Gly Trypsin -> 

pH=5-ll 

GLPl (7-36)-Gly 

15 

The production of GLPl 7-36-NH2 or GLPl 7-36-Gly 
is monitored by HLPC and the reaction stopped by the 
addition of 2 M HCl until the pH is below 3. As 
described by Bongers et al . , Jnt, J. Peptide Protein 
20 Res. , 40:268 (1992), the GLPl (7-36)-Gly can be 

converted to an eunide in a later reaction by use of the 
C-terminal a-amidating enzyme as described in Ohsuye et 
al., cited supra • 

25 EXAMPLE 2 

Preparation of Amidated Recombinant GLPl (7-36) -NHj 
From a Multicopy Fusion Protein Construct 

Amidated recombinant GLPl 7-36-NH2 was prepared 
30 from a multicopy fusion protein containing four copies 
of a modified truncated GLPl peptide having amino acid 
residues 1-34 of the native or naturally occurring 
polypeptide and the terminal amino acid residues of 
Ala-Phe-Ala at residues 35-37. 
35 A DNA construct encoding a multicopy fusion 

protein can be prepared as described in Example 1. 
Briefly, a non-essential preferred embodiment is a gene 
encoding human carbonic anhydrase modified so that the 
codon for methionine at amino acid residue 240 is 
40 replaced with the codon for leucine and subcloned into a 
vector that can be expressed in E, coll such as pB0304, 
as described in U.S. application Serial No, 07/552,810. 
The DNA sequence for the intercoxmecting peptide 
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encoding a methionine residue, the DNA sequence encoding 
four copies of the prefixed recombinant 6LP1 (1- 
34 ) -Ala-Phe-Ala polypeptide can be synthesized by 
automated DNA synthesis with the methionine codon 5' to 
5 the DNA sequence encoding the truncated modified GLPl 
sequence. This DNA sequence is then inserted 
immediately downstream from the gene for human carbonic 
anhydrase in the E. coli expression vector by standcurd 
methods. The expression vector encoding the multicopy 
10 fusion protein is then transformed into E> coli . 

Transformants are selected and amplified. The multicopy 
fusion protein is recovered and purified from cell 
lysates as described in Example 1. 

Once purified, the multicopy fusion protein is 
15 cleaved with cyanogen bromide as described in Example 1 
to yield htzman carbonic anhydrase and a multicopy 
protein containing four copies of the truncated 
GLPl (1-34) -Ala-Phe-Ala polypeptide. The multicopy 
peptide can be separated from human carbonic anhydrase 
20 by standard chromatographic methods such as ion 

exchange, reverse phase or size exclusion or by the 
precipitation method described in Example 1. 

The multicopy polypeptide is then cleaved and 
transpeptidated j/ith trypsin as follows . Trypsin will 
25 cleave the multicopy polypeptide into single copy 
polypeptides between amino acid residues 6-7 and 
residues 34 and 35 to yield four single copies of GLP 
(7-34) and peptides containing Ala-Phe-Ala- connected to 
amino acid residues 1-6. When the cleavage is conducted 
30 in the presence of an appropriate nucleophilic addition 
unit, such as Gly-Arg-NH2, transpeptidation occurs. For 
example, freeze dried multicopy polypeptide is dissolved 
at 10 mg/ml and a buffer at pH 5-11 with .01 to 1 ml 
Gly-Arg-NHi which contains trypsin at a 1:1000 ratio 
35 ( trypsin: peptide ) . The trypsin cleaves the multicopy 
peptide as described above to yield GLPl (7-34) core 
polypeptide which forms an acyl -enzyme intermediate with 
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the trypsin. The Gly-Arg-NH2 acts as a nucleophile and 
transpeptidation occurs at amino acid residue 34, The 
production of GLPl (7-36)-NH2 is monitored by HPLC and 
the reaction stopped by the addition of HCl when the 
5 reaction has reached completion. 

EXAMPLE 3 

Preparation of Amidated Recoxnblnantly Produced 
10 GLPl f7-36>-NH^ From a Multicopy Polypeptide 

Modified recombinant GLPl (7-36)-NH2 can also be 
prepared by cleavage and transpeptidation of a multicopy 
polypeptide. The multicopy polypeptide was formed with 

15 four copies of core GLPl (7-34) connected to a terminal 
core GLPl (7-34) linked to a Ala-Phe-Ala leaving unit. 

A DNA construct encoding the recombinant 
multicopy polypeptide can be formed as described for a 
multicopy or single copy recombinant fusion protein as 

20 described in Examples 1 and 2, but without the carbonic 
anhydrase as fusion protein or the methionine codon as 
interconnecting peptide. A DNA sequence encoding four 
copies of the GLPl (7-34) core polypeptide and a 
terminal GLPl (7-34) -Ala-Phe-Ala recombinant 

25 polypeptide can be synthesized by automated DNA 

synthesis. The DUA sequence is then subcloned into an 
expression vector compatible with coli and 
transformed into E. coli . Trans formants expressing the 
recombinant multicopy polypeptides were selected and 

30 amplified. It is likely that the recombinant multicopy 
polypeptide will be found in inclusion bodies. The 
recombinant multicopy polypeptide can be purified from 
inclusion bodies as follows. 

Cells are lysed with sonication in 50 ml 

35 Tris-Hcl (pH=7.9) and 2.5 ml EDTA containing 100 mM NaCl 
with 10 micrograms of DNase 1. Lysozyme (30 ml) is 
added and the lysate is incubated overnight to disrupt 
the cell fragments. To purify recoiribinant polypeptide 
from insoluble granules, the lysate is then centrifuged 
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and the insoluble granules are incubated with sodium 
deoxycholate, and washed several times. The inclusion 
bodies are then frozen. The thawed inclusion bodies are 
further purified by ultrafiltration and DEAE 
5 chromatography after being dissolved in an appropriate 
chaotropic reagent, such as urea, guanidine, or 50 mM 
MaOH. 

Once purified, the recombinant multicopy 
polypeptide is cleaved and transpeptidated with trypsin. 

10 Trypsin will cleave at the -Lys- at residue 34 to yield 
single copies of the core GLPl (7-34) and a copy of a 
GLPl (7-34)-Ala-Phe-Ala. The GLPl ( 7-34)-Ala-Phe-Ala 
will also be cleaved by trypsin to yield GLPl (7^34) 
core and the leaving unit Ala-Phe-Ala. The trypsin 

15 cleavage of the multicopy polypeptide is conducted in 
the presence of a nucleophilic addition unit such as 
Gly-Arg-NHj so that the final product is a GLPl (7-3G)NH2 
modified recombinant polypeptide as a result of trypsin 
catalyzed transpeptidation. 

20 

EXAMPLE 4 

Preparation of Amidated Recombinant Growth Hormone 
Releasing Factor (GRF) (l-44)-NH2 from a 
25 Fusion Protein Construct 

A modified recombinant growth hormone releasing 
factor can be prepared by cleavage and transpeptidation 
of a recombinant multicopy fusion protein. The native 
30 or naturally occurring sequence of growth hormone 
releasing factor (SEQ ID NO: 16) is: 

Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 

35 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- 
13 

Asp-Ile-Met-Ser-Arg^Gln-Gln-Gly-Glu-Ser-Asn-Gln- 
40 25 



Glu-Arg-Gly-Ala-Arg-Ala-Arg-Leu-NH, 
37 44 
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A recombinantly produced growth hormone 
releasing factor (GRF) is not produced in the highly 
5 active amidated form and an additional step using an 
a-amidating enzyme is typically necessary. However, a 
strategy can be designed to form the amidated GRF by 
combining cleavage of a recombinant single copy fusion 
protein with transpeptidation . 

10 A DNA construct encoding a single copy fusion 

protein can be formed as described in Example 1. 
Briefly, the gene for human carbonic anhydrase is 
subcloned into a E* coli expression vector such as 
pB0304, as described in U.S. application Serial No. 

15 07/552,810. The DNA sequence encoding an 

interconnecting peptide of the following sequence (SEQ 
ID N0:9): 

Asn-Gly-Pro-Arg 
is synthesized by automated DNA synthesis. A DNA 
20 sequence encoding a truncated core GRF polypeptide and 
the leaving unit -Ala-, for example GRF (1-41)-Ala-Arg- 
Leu-Ala, having the following sequence (SEQ ID NO: 17): 

25 Ty^-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- 
13 

30 Asp-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- 
25 

Glu-Arg-Gly-Ala-Arg — Ala-Arg-Leu-Ala 
37 41 45 

35 

is synthesized by automated DNA synthetic methods. The 
terminal Ala-residue is added because it serves as a 
good leaving unit for the cleavage and transpeptidation 
40 reaction. The DNA sequence for the interconnecting 
peptide and the truncated modified GRF (1-41) -Ala (SEQ 
ID NO: 17) peptide can be synthesized together as a 
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single sequence or separately and then subcloned 
immediately downstream from the gene for hiiman carbonic 
anhydrase to form the expression vector for the fusion 
protein. The expression vector is then transformed into 
5 E. coli and trans formants are selected and amplified. 
The fusion protein is isolated and purified from cell 
lysates using affinity chromatography as described in 
Example 1. 

Once purified, the human carbonic anhydrase 
10 fusion protein is digested with 2 M NH2OH, and 5 M 

guanidine hydrochloride to release the GRF ( 1-41) -Ala- 
Arg-Leu-Ala peptide from the fusion protein. The 
cleaved GRF 1-41-Ala-Arg-Leu-Ala peptide can be 
separated from human carbonic anhydrase by nozmal 

15 chromatographic methods, i.e. ion exchange, reverse 
phase and size exclusion. Alternatively, the peptide 
can be separated from the carrier protein by dilution of 
the reaction mixture with water and acetic acid so that 
the concentration of acetic acid is made at 10% 

20 volume/volume (v/v) • The addition 5.6 g/lOO ml sodivun 
sulfate (NaaSo^) to this mixture results in a precipitate 
which can be removed by centrifugation at 10,000 x g for 
10 minutes. The human carbonic anhydrase is selectively 
precipitated from the reaction mixture. The supernatant 

25 is applied to an open C-8 coliimn which is rinsed with 
four column volumes of 10% acetic acid and the peptide 
is eluted from the column with 50% acetonitrile in 10% 
acetic acid. The peptide is then freeze dried. 

For cleavage and transpeptidation, the purified 

30 GRF 1-41-Ala peptide is then cleaved with thrombin in 

the presence of either Ala-Arg-Leu-NHj or Ala-Arg-Leu-Gly 
(SEQ ID N0:8). The purified GRF (1-41) -Ala is dissolved 
at 10 mg/ml in a buffer at pH 5-11 with 0.01 to 1 M 
Ala-Arg-Leu-NH2 or Ala-Arg-Leu-Gly (SEQ ID NO: 8) which 

35 contains thrombin at a 1:3000 ratio ( thrombin : pept ide ) . 
It has been discovered that the GAR sequence at residues 
39-41 in the GRF (1-41) peptide (SEQ ID N0:7) is a site 
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recognized and cleaved by thrombin. The thrombin 
cleaves the Ala from the carboxyl terminus and forms an 
acyl-enzyme intermediate. The Ala-Arg-Leu-NHj or Ala- 
Arg-Leu-Gly (SEQ ID NO: 8) act as a nucleophile and 
5 transpeptidation occurs as follows: 

Reaction 1: 

GRF (1-41)-Ala-Arg-Leu-Ala + Ala-Arg-Leu-NHj Thrombin - > 
10 pH = 5-11 

GRF (1-41) Ala-Arg-Leu-NH2 + Ala-Arg-Leu-Ala 

15 Reaction 2: 

GRF (1-41) -Ala-Arg-Leu-Ala + Ala-Arg-Leu-Gly Thrombin - 
> 

pH = 5-11 

20 

GRF (1-41) Ala-Arg-Leu-Gly + Ala-Arg-Leu-Ala. 

The final product of reaction 1 corresponds to 
25 the amidated native GRF (l-44)-NH2. The final product of 
reaction 2 corresponds to GRF (l-44)-Gly. The GRF 1- 
44-Gly can be converted to the amide by later reaction 
using a C-teiminal a-amidating enzyme. 

30 EXAMPLE 5 

Preparation of Amidated GRF (l->44)-lIH2 From a 
Recombinant Multicopy Polypeptide 

35 Amidated recombinant GRF (l-44)-NH2 can be 

prepared from a recombinant multicopy polypeptide by 
cleavage and transpeptidation. 

The recombinant multicopy peptide is produced 
by cells transformed with an expression vector. A DNA 

40 construct is formed by joining four copies of the coding 
sequence for a truncated GRF (1-41) joined end to end 
and having a terminal DNA sequence encoding a modified 
truncated GRF (l-41)-ala peptide. This DNA construct is 
formed by automated DNA synthesis and subcloned into a 

45 E. coli expression vector such pB0304. The expression 
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vector Is then transformed into E* coli and 
transfonnants are selected and then amplified. The 
multicopy polypeptide is isolated from cell lysates as 
. described in Example 3 . 
5 Oiice purified, the multicopy polypeptide is 

cleaved and transpeptidated with thrombin. Thrombin 
cleaves after the GAR sequences of residues 39-41 in the 
GRF 1-41 peptide to yield single copies of truncated GRF 
(1-41) and a modified truncated GRF 1-41-ala, The 
10 modified truncated GRF 1-41-ala is also cleaved by 

thrombin to yield GRF 1-41 and alanine. The cleavage 
with thrombin is conducted in the presence of Ala-Arg- 
Leu-NH2. The Ala-Arg-Leu-NHj acts a nucleophile 
resulting in transpeptidation as follows: 

15 

GRF (1-41) and 

GRF (1-41)-Ala-Arg-Leu-Ala + Ala-Arg-Leu-NH^ Thrombin -> 

pH = 5-11 

20 GRF 1-44 -NH2 + Ala-Arg-Leu-Ala 



The final product is amidated native GRF (l-44)-NH2. 

25 All publications and patent applications in 

this specification are indicative of the level of 
ordinary skill in the art to which this invention 
pertains. All publications and patent applications are 
herein incorporated by reference to the same extent as 

30 if each individual publication or patent application was 
specifically and individually indicated by reference. 

It will be apparent to one of ordinary skill in 
the art that many changes and modifications can be made 
in the invention without departing from the spirit or 

35 scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: BioNebraska. Inc. 

(B) STREET: 3820 NW 46th Street 

(C) CITY: Lincoln 

(D) STATE: N£ 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 68524 

(G) TELEPHONE: 402-470-2100 

(H) TELEFAX: 402-470-2345 

(11) TITLE OF INVENTION: Enzymatic Method for Modification of 
Recombinant Polypeptides 

(ill) NUMBER OF SEQUENCES: 26 

(iv) COMPUTER READABLE FORM: " 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/095,162 

(B) FILING DATE: 20-JUL-1993 
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(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONEj GLPl 7-36.NH2 (Glucagon- like Peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 .10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl ( 7-34) -Ala-Phe-Ala 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
^5 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Ala Phe Ala 
20 25 30 
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(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLFl (7-36)-Gly 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3t 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly 

20 " 25 30 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (7-34) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys 
20 25 
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(2) INFORMATION FOR S£Q ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(zi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

His Asp Glu Phe GIu Arg His 
1 5 



(2) INFORMATION FOR SEQ ID N0:6t - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-34) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 . 

Val Lys 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Al amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 6RF (1-41) (Growth Hormone Releasing Factor) 



(3ci) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
15 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 



Ala Arg Leu Gly 
1 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asn Gly Pro Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 10: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Enterokinase cleavage enzyme 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GACGACGACG ATAAA 



15 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Factor Xa cleavage enzyme 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATTCAAGGAA OA 22 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Thrombin cleavage enzyme 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGA6GACCAA 6A 



12 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ^(genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Ubiquitin cleaving enzyme 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGAGGA6GA 9 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Renin cleavage enzyme 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
CATCCTTTTC ATCTGCTGGT TTAT 



24 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-36) 



(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 

Val Lys Gly Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-44) 



(3ci) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1.5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu 
35 40 
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(2) INFORMATION FOR SEQ ID NO; 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41)-Ala-Arg-Leu-Ala 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 - 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Ala 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1.5 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Xaa 
20 25 
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(2) INFORHATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-37) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 ' 25 , 30 

Val Lys Gly Arg Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Ala Arg Leu Ala 
1 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 6RF <l-44)-Gly 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Tyr Ala Asp Ala lie Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp lie Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

Arg Ala Arg Leu Ala 

1 .5 
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(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl ( 1-34 )-Ala-Phe- Ala 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

His Asp 61u Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 

Val Lys Ala Phe Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41)-Ala 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala 
35 40 
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(2) INFORHATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTEEISTICS : 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOOy: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41) -Ala- Arg-Leu 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 

20 - 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41) Ala-Arg-Leu-Gly 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Gly 
35 40 45 
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mUT IS CLAIMED IS: 

!• A process for modifying a polypeptide by 

transpeptidation comprising: contacting together an 
addition unit, an endopeptidase enzyme specific for 
an enzyme cleavage site, and the recombinant 
polypeptide of at least one leaving unit and a core, 
wherein the polypeptide is a recombinant polypeptide 
leaving unit is linked to the core by the enzyme 
cleavage site recognized by the endopeptidase 
enzyme, to produce a modified recombinant 
polypeptide having the addition unit attached to the 
core and substituted for the leaving unit. 

2. The process according to claim 1, wherein the 
endopeptidase enzyme is a serine or cysteine 
peptidase. 

3. The process according to claim 1, wherein the 
endopeptidase enzyme is selected from the group 
consisting of trypsin, thrombin, chymotrypsin, 
enterokinase , subtilisin, ficin, papian, and factor 
Xa. 

4. The process according to claim 1, wherein the core 
is a truncated version of a natural polypeptide. 

5. The process according to claim 1, wherein the core 
is GLPl (7-34) (SEQ ID N0:4), GLPl (1-34) (SEQ ID 
NO26), or GRF (1-41) (SEQ ID NO:?). 

6. The process according to claim 1, wherein the 
leaving unit of the recombinant polypeptide 
comprises one or more amino acid residues. 

7. The process according to claim 1, wherein the 
addition unit of the modified recombinant 
polypeptide comprises one or more amino acid 
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residues which may or may not be altered at the C* 
terminal a carboxy. 

8. The process according to claim 1, wherein for the 
endopeptidase cleavage enzyme trypsin, the addition 
luiit is comprised of Gly-Arg-NHz and Gly-Arg-Gly. 

9. The process according to claim 1, wherein for the 
endopeptidase cleavage enzyme thrombin, the addition 
unit is Ala-Arg-Leu-NHj and Ala-Arg-Leu-Gly (SEQ ID 
N0s8), 

10. The process according to claim 1, wherein the 
endopeptidase enzyme trygsin cleaves at a cleavage 
site at the carboxy side of -Lys- and -Arg- bonds. 

11. The process according to claim 1, wherein the -Lys- 
moiety is a poor substrate for the endopeptidase 
trypsin when immediately adjacent to an amino acid 
having a carboxylic acid containing side chain. 

12. The process according to claim 1, wherein the 
endopeptidase enzyme thrombin operates on a cleavage 
site for -Arg- within a recognition sequence 
Gly-Pro-Arg. 

13. The process according to claim 1, wherein the 
endopeptidase enzyme thrombin operates on a cleavage 
site for -Arg- within a recognition sequence 
Gly-Ala-Arg. 

14. The process according to claim 1, wherein the 
transpeptidation reaction occurs in a buffered 
solution at a pH of about 5 to about 11. 

15. The process according to claim 1, wherein the 
transpeptidation reaction occurs with an 
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endopeptidase cleavage enzyme: recombinant 
polypeptide molar ratio of about 1:1000 to 1:10,000. 

16. The process according to claim 1, wherein the 
transpeptidation reaction utilizing the trypsin 
endopeptidase occurs with a cleavage 

enzyme: recombinant polypeptide molar ratio of about 
1:1000 to 1:5000. 

17. The process according to claim 1, wherein the 
transpeptidation reaction utilizing the enzyme 
thrombin occurs with a cleavage enzyme: recombinant 
polypeptide molar ratio of about 1:1000 to 1:10,000. 

18. A process for cleavage of a polypeptide at the 
carboxy terminus of the amino acid -Arg-, wherein 
the endopeptidase enzyme thrombin cleaves at the 
carboxy terminus of the amino acid -Arg- within a 
cleavage recognition sequence of Gly-Ala-Arg. 

19. A process for modifying a recombinant polypeptide by 
transpeptidation comprising: 

(a) forming a recombinant polypeptide of a core 
and at least one leaving. unit wherein the leaving 
unit is linked to the core by an enzyme cleayage 
site; and 

(b) contacting an addition unit and the 
recombinant polypeptide with an endopeptidase enzyme 
specific for the cleavage site to produce a modified 
recombinant polypeptide having the addition unit 
attached to the core and substituted for the leaving 
unit. 

20. A process for modifying a recombinant polypeptide by 
transpeptidation comprising: contacting together 
addition units, an endopeptidase enzyme specific for 
an enzyme cleavage site, a multicopy recombinant 
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polypeptide of two or more cores, and a leaving unit 
linked to a terminal core, wherein each link is an 
enzyme cleavage site recognized by the endopeptidase 
enzyme, and transpeptidation occurs simultaneously 
with cleavage of the multicopy recombinant 
polypeptide into individual core units each with an 
addition unit linked to its downstream end to 
produce a modified recombinant polypeptide product, 

21. The process according to claim 20, wherein the core 
units are linked through an intraconnecting peptide 
by cleavage sites operated on by the endopeptidase 
enzyme . 

22. A polypeptide having an amino acid sequence (SEQ ID 
N0:18): 

His-Ala-Glu-Gly-Thr-Phe-Thr- 
7 

Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu- 

Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile- 

26 27 

Ala-Trp-Leu- Val -Ly s -X 
34 

wherein X is selected from the group consisting of 

(a) Gly-Arg-NHj; 

(b) Gly-Arg-Gly; and 

(c) Gly-Arg-Gly-NH2, 

which is produced by the process according to claim 
1. 

23. A polypeptide comprising multiple copies of 
contiguously linked GLPl (1-34) (SEQ ID N0:6), 
wherein the terminal copy is linked to a leaving 
unit. 
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24. A polypeptide comprising multiple copies of 
contiguously linked GLPl (7-34) (SEQ ID N0:4)^ 
wherein the terminal copy is linked to a leaving 
unit. 

25. A polypeptide comprising multiple copies of 
contiguously linked GRF 1-41 (SEQ ID N0:7), wherein 
the terminal copy is linked to a leaving unit. 

26. An expression vector containing a DNA sequence 
coding for a polypeptide of at least one leaving 
unit and a core wherein the leaving unit is linked 
to the core by an enzyme cleavage site which is 
recognized by an endopeptidase enzyme, said enzyme 
being capable of causing the substitution of the 
addition unit for the leaving unit. 

27. A recombinant gene containing a DNA sequence coding 
for a polypeptide of at least one leaving unit and a 
core wherein the leaving unit is linked to the core 
by an enzyme cleavage site which is recognized by an 
endopeptidase enzyme, the enzyme being capable of 
causing the substitution of an addition unit for the 
leaving unit. 

28. A transformed cell expressing the recombinant gene 
containing a DNA sequence coding for a polypeptide 
of at least one leaving unit and a core wherein the 
leaving unit is linked to the core by an enzyme 
cleavage site which is recognized by an 
endopeptidase enzyme, said enzyme being capable of 
causing the substitution of an addition unit for the 
leaving unit. 



29. The transformed cell according to claim 28, wherein 
the transfonned cell comprises a prokaryotic cell. 
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30. The transformed cell according to claim 29, wherein 
the prokaryotic cell is an E, coli , 

31. The transformed cell according to claim 28, wherein 
the transformed cell comprises a eukaryotic cell. 

32. A pharmaceutical composition of an effective fiunount 
of GRF (l-44)-NH2 (SEQ ID NO: 16) in a physiological 
acceptable carrier to treat short stature syndrome, 
endometriosis and osteoporosis, wherein the 
GRP(l-44)-NH2 (SEQ ID NO: 16) is produced by the 
process according to claim 1. 

33. A pharmaceutical composition of an effective amount 
of 6LP1 {7-36)-NH2 (SEQ ID N0:1) in a physiological 
acceptable carrier to treat dicibetes mellitus type 
II, wherein the GLPl (7-36)-NH2 is produced by the 
process according to claim 1. 

34. A pharmaceutical composition of an effective amount 
of GLPl (7-36)-Gly (SEQ ID N0:3) in a physiological 
acceptable carrier to treat diabetes mellitus type 
II, wherein the GLPl (7-36)-Gly (SEQ ID NO: 3) is 
produced by the process according to claim 1. 

35. A pharmaceutical composition of an effective amount 
of GLPl (7-36)-Gly-NH2 (SEQ ID NO: 3) in a 
physiological acceptable carrier to treat dicibetes 
mellitus type II, wherein the GLPl ( 7-36)-Gly-NH2 
(SEQ ID NO: 3) is produced by the process according 
to claim 1. 

36. A method for treating diabetes mellitus type II 
comprising: administering an effective amount of the 
modified recombinant polypeptide GRF (l-44)-NH2 (SEQ 
ID NO: 16) produced by the process of claim 1, 
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37. A method for treating an human or animal comprising: 
administering an effective amount of the modified 
recombinant polypeptide GLPl (7-36)-NH2 (SEQ ID N0:1) 
produced by the process of claim 1. 

38. A process for modifying a recombinant polypeptide by 
sequential addition by: 

(a) contacting together an endopeptidase 
enzyme specification for an enzyme cleavage site, 
and a recombinant polypeptide of at least one 
leaving unit and a core, forming the hydrolyzed 
product ; and 

(b) contacting the hydrolyzed product with an 
endopeptidase enzyme specific for an enzyme cleavage 
site, and an addition unit to produce a modified 
recombinant polypeptide having the addition unit 
attached to the core and substituted at the hydroxy 
group. 
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